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The integral membrane-bound Nrf 1 transcription factor fulfils important functions in maintaining cellular 
homeostasis and organ integrity, but how it is controlled vectorially is unknown. Herein, creative use of 
Gal4-based reporter assays with protease protection assays (GRAPPA), and double fluorescence protease 
protection (dFPP), reveals that the membrane-topogenic vectorial behaviour of Nrf 1 dictates its 
post-translational modification and transactivation activity. Nrfl is integrated within endoplasmic 
reticulum (ER) membranes through its NHBl-associated TM1 in cooperation with other semihydrophobic 
amphipathic regions. The transactivation domains (TADs) of Nrfl, including its Asn/Ser/Thr-rich (NST) 
glycodomain, are transiently translocated into the ER lumen, where it is glycosylated in the presence of 
glucose to become a 120-kDa isoform. Thereafter, the NST-adjoining TADs are partially repartitioned out 
of membranes into the cyto/nucleoplasmic side, where Nrfl is subject to deglycosylation and/or proteolysis 
to generate 95-kDa and 85-kDa isoforms. Therefore, the vectorial process of Nrfl controls its target gene 
expression. 



All organisms living in an oxygenated environment have evolved efficient strategies enabling their con- 
stituent cells to express a series of robust homeostatic genes in order to defend against cellular oxidative 
stress and adapt to the changing surroundings. The expression of antioxidant cytoprotective genes is 
controlled primarily by the cap'n'collar (CNC) family of transcription factors 1,2 . This family comprises the 
Drosophila Cnc protein as the founding member 3,4 , the Caenorhabditis elegans protein skinhead-1 (Skn-1) 5,6 , 
and four vertebrate activators nuclear factor-erythroid 2 (NF-E2) p45 subunit 7-9 , NF-E2 p45-related factor 1 [Nrfl 
(ref. 10), including its long form, called transcription factor 11 (TCF11) 11,12 and its short isoform, designated as 
locus control region-factor 1 (LCR-F1) 13,14 , see Fig. 1], Nrf2 (ref. 15), and Nrf3 (refs. 16,17), as well as two distantly 
related repressors BTB and CNC homolog 1 (Bachl) 18,19 and Bach2 (refs. 20,21). In addition to cytoprotection 
against various oxidants and electrophiles, the CNC family members also regulate expression of a wide range of 
genes that are involved in development, growth, longevity, and cellular defence, metabolism, and repair response 
under either normal homeostatic or pathophysiological conditions. Most of those genes are direct targets of CNC 
family transcription factors, which bind to the antioxidant/electrophile response elements (AREs/EpREs) 1,2 . 

In mammals, Nrfl and Nrf2 represent two principal orthologs of the Cnc and Skn-1 factors, that regulate the 
basal and inducible expression of antioxidant, detoxification and 26S proteosomal component genes 1,22 25 . To 
date, there has been a disproportionate focus on Nrf2 and relatively less is known about the function of Nrfl. In 
fact, global knockout of Nrfl in the mouse leads to embryonic lethality and severe oxidative stress 13,26,27 , indicating 
that loss of Nrfl function cannot be compensated by the presence of Nrf2 in these animals. Specifically, con- 
ditional knockout of Nrfl in the liver, bone and brain results in non-alcoholic steatohepatitis and hepatoma 28,29 , 
reduced bone size 30 and neurodegenerative disorders 31,32 , respectively. By contrast, Nrf2 knockout mice are viable 
with no obvious phenotypes 33 , suggesting it seems to be dispensable. These facts convincingly demonstrate that 
Nrfl, but not Nrf2, is essential for maintaining both organ integrity and cellular (redox, lipid and protein) 
homeostasis. This indicates that Nrfl fulfils a unique and indispensable function, that is distinct from Nrf2, in 
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Domain structures of distinct Nrf1 isoforms 
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Figure 1 | Structural domains of distinct Nrfl isoforms. The N-terminal domain (NTD) contains a TM1 -associated NHB1 sequence that dictates Nrfl 
to target to the endoplasmic reticulum (ER) and enables it to anchor within the membrane. Both CNC and bZIP regions were combined into the NehlL 
(Nrf2-ECH homology 1-like) domain that enables heterodimerization with a small Maf or other bZIP proteins before binding to antioxidant response 
element (ARE) sequences in the promoter regions of target genes. The transactivation domains (TADs) of Nrfl comprise AD1 (acidic domain 1), NST 
(Asn/Ser/Thr-rich), AD2 and SR (serine-repeat) regions. On the C-terminal border of NST glycodomain immediately to the acidic-hydrophobic 
amphipathic AD2, the TMi segment may act as a luminal-anchoring switch to control repartitioning of its flanking AD1 and AD2/SR domains. Within 
AD 1 , Neh2L contains the DIDLID/DLG element and ETGE motif; both are conserved with equivalents of Nrf2 and CncC. In addition to Neh2L, the 
Neh5L, but not Neh4L, subdomain is included within AD 1 of Nrfl , when compared with its longer TCF 1 1 . The N-linked glycosylation of the NST domain 
confers Nrfl and TCF11 to migrate electrophoretically at estimated masses of ~120-kDa and —140-kDa, respectively, whilst their non-glycosylated/ 
deglycosylated proteins exhibit fast electrophoretic mobilities at ~95-kDa and ~ 1 10-kDa. They may be further processed through selective proteolysis to 
give rise to multiple cleaved forms of between 85-kDa and 25-kDa. Of which the putative activated 85-kDa protein can be also translated through an 
alternative initiation signal (in Nrfl A767 clones designated originally 51,52 ) so that it lacks an ER-anchoring NTD (thus called NrflAN), and maybe further 
processed into an unstable polypeptide of 65-kDa or 55-kDa. The 55-kDa form, that was originally designated LCR-F1 (called Nrfl (3 herein), is produced 
by in-frame translation and/or selective proteolysis, and may be rapidly degraded to yield short dominant-negative isoforms of 46-kDa, 36-kDa (called 
Nrfly) and 25-kDa (called Nrfl8). For convenience, Nrfl variant AD and AlO clones 51,52 are renamed as NrflD or NrflAS, respectively. The triangle 
represents a loss of the corresponding region by alternative splicing, whereas the scissors with a question mark (?) indicate unidentified cleavage sites. 



controlling a subset of ARE-battery genes that are responsible for 
cytoprotection against stress, apoptosis, degeneration, ageing, 
inflammation and carcinogenesis. 

Amongst the CNC family members, Nrfl is an integral 
membrane-bound glycoprotein with several distinct isoforms 
(Fig. 1). The full-length Nrfl is first synthesized as a 95-kDa 



non-glycosylated protein [i.e. the term non-glycosylation is defined 
that the primordial portion of Nrfl residing on the cytoplasmic side 
of membranes has actually never been glycosylated at the consensus 
asparagines (Asn-X-Ser/Thr, in which X represents any of other 
amino acids rather than proline) in its Asn/Ser/Thr-rich (NST) 
domain, because it is clear that the N-linked glycosylation reaction 
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catalyzed by oligosaccharyltransferases (OST) occurs in the lumen of 
the endoplasmic reticulum (ER) 34,35 ]. Subsequently, the non-glyco- 
sylated Nrfl protein is co-translationally targeted to the ER through 
its N-terminal homology box 1 (NHB1) signal sequence 36 38 . The 
NHB1 -associated transmembrane region (called TM1, aa 7-26) 
determines the membrane-topology of Nrfl (refs. 38-40), which is 
highly conserved with those of TCF1 1 (ref. 22), Nrf3 (ref. 41), CncC 24 
and Skn-1 (ref. 42); they are comprised of an NHB1-CNC subfamily 
of membrane-bound transcription factors, with similar topologies 
being predicted (Figs. SI and S2). Once Nrfl is anchored within 
the ER membrane through its TM1 region, its NST glycodomain 
(aa 299-400) is translocated into the lumen of this organelle, where 
the NST domain is asparagine-glycosylated to yield an inactive 120- 
kDa glycoprotein. Thereafter, Nrfl is sorted out of the ER through the 
outer nuclear envelope membrane and into the inner nuclear mem- 
brane 39 . When required, the luminal regions of Nrfl is partially repar- 
titioned out of membranes and retrotranslocated into the cyto/ 
nucleoplasmic subcellular compartments, whereupon the protein is 
allowed for N-linked deglycosylation by a peptide:N-glycosidase 
(PNGase) 43 46 (i.e. the term deglycosylation), in order to generate an 
active 95-kDa transcription factor. In the vectorial process, Nrfl may 
also be selectively subjected to its proteolytic processing to give rise to 
either a putatively cleaved 85-kDa activator or other short dominant- 
negative isoforms, so that they are released from the ER to translocate 
the nucleus 39,47 49 . However, it is notable that the positively-staining 
signal for Nrfl is neither observed in the mitochondria nor the Golgi 
apparatus under confocal and immuno-electron microscopes 36,38,39 . 

Once the 95-kDa and/or 85-kDa Nrfl reach the nucleoplasmic 
compartments, they can gain access to the general transcriptional 
machinery (GTM) primarily via its Nrf2-ECH homology 5-like sub- 
domain (Neh5L, aa 280-298) 39,50 , allowing transactivation of target 
genes. By contrast, LCR-F 1 (also called Nrfl p, functioning as a weak 
activator 14,39,51 ), along with other dominant-negative isoforms Nrfly 
and NrflS 14,39,51,52 (see Fig. 1), are located predominantly in the nuc- 
leus 39,47 . Therein, the DNA-binding of all Nrfl isoforms to ARE 
sequences in the target gene promoters is exerted through its 
CNC-bZIP domain heterodimerizing with a small Maf protein or 
other bZIP factors 2,53,54 . The ability of Nrfl to activate ARE-driven 
genes is differentially regulated by several distinct transactivation 
domains (TADs), that comprise its acidic domain 1 (AD1, that com- 
prises aa 125-298 covering the PEST1 sequence, and the Neh2L and 
Neh5L subdomains), NST glycodomain (aa 299-400), AD2 (aa 403- 
453), and serine-repeat domain (SR, aa 454-489) 14,39 . Intriguingly, 
the TMi glycopeptide (i.e. a putative intermediate transmembrane - 
associated helix formed by aa 374-393, see Fig. SI) is located in the 
NST C-terminal border immediately to the acidic-hydrophobic 
amphipathic region within AD2 (refs. 22,39), whereas the TMp hinge 
region (i.e. a Pro-kinked transmembrane helix that is predicted to be 
formed by aa 507-525, see Fig. SI) is located in the N-terminal 
border of the Neh6L domain (aa 490-580), which is flanked N- 
terminally by the acidic TADs (aa 125-489) and C-terminaily by 
the basic CNC-bZIP domain (aa 581-687). However, the molecular 
mechanism by which the transactivation activity of Nrfl is tempo- 
spatially controlled in its membrane-topogenic processes remains 
elusive. Herein, our novel evidence reveals distinct topogenic- 
vectorial processes whereby some of Nrfl isoforms are translocated 
into the ER and/or retrotranslocated across the membranes into the 
cyto/nucleoplasmic side, allowing transactivation of its target genes. 

It is notable that our previous work showed that inhibition of N- 
linked glycosylation by tunicamycin (TU) results in the wild-type 
Nrfl being expressed as a non-glycosylated 95-kDa protein, with a 
transactivation activity that is 25% lower than that of wild-type Nrfl 
treated with vehicle control 39 . By contrast, the PNGase-based 
deglycosylation inhibitor Z-VAD-fmk [carbobenzoxy-valyl-alanyl- 
aspartyl-(0-methyl)-fluoromethyl ketone] 55 causes an increase in 
the abundance of the inactive 120-kDa Nrfl glycoprotein, as 



accompanied by a partial decrease in its transactivation activity 
(unpublished data). Further, the Nrfl (1 - 7>xN/D mutant (in which all 
seven asparagine glycosylation consensus sites in the NST domain 
were mutated into aspartate residues in order to create a potent 
activator, representing deglycosylated 95-kDa Nrfl protein) med- 
iates an increased transactivation activity of ARE-driven gene 
expression, whereas the Nrfl (1 ~ 7)xN/Q mutant (in which all seven 
asparagine consensus sites in the NST domain were mutated into 
glutamine residues so as to create a non-glycosylated 95-kDa pro- 
tein) exhibits a modestly decreased activity 39 , when compared with 
that of the wild-type Nrfl factor. These findings demonstrate that the 
transactivation activity of Nrfl is, at least in part, regulated by the 
changing glycosylation status of its NST domain situated within 
the centre of acidic TADs. Such asparagines-linked glycosylation of 
Nrfl, as other membrane proteins characterized by Goder and 
Spiess 56 , could influence (or be influenced by) its dynamic topo- 
logical orientation within and around membranes. These have led 
us to envisage that glucose levels might alter the glycosylation/degly- 
cosylation status of Nrfl, and in turn, these modifications could be 
controlled by membrane-topogenic vectorial behaviours of this 
NHB 1 -CNC protein that dynamically moves in and out of ER mem- 
branes during its topological folding or post-translational proces- 
sing. As expected, our present study demonstrates that distinct 
vectorial processes of Nrfl are dictated by glucose consumption 
and deprivation. Importantly, the membrane-topogenic vectorial 
behaviour of Nrfl controls its post-translational modification and 
its ability to transactivate its target gene expression. 

Results 

Distinct Nrfl isoforms are co-expressed in different mammalian 
species. Accumulating evidence reveals that over eleven Nrfl iso- 
forms (Fig. 1) are produced from the single nfe2ll gene, though 
differentially expressed, in different mammalian species 1,2,51,52,57 . 
These isoforms are synthesized by translation through distinct ini- 
tiation signals (i.e. the first or internal start ATG codons) embedded 
in different lengths of open reading frames, portions of which can be 
alternatively spliced from the cognate mRNAs 10-14,51-53,58 . The 
prototypic full-length Nrfl protein arises by alternative splicing of 
the mRNA enabling translation of the long TCF1 1 form 12,58 , which is 
expressed in the human and rat, but not in the mouse 10,12,14 . Although 
removal of the Neh4L-encoding nucleotides results in a loss of the 
putative TAD element in Nrfl, this factor was shown to have a 
similar ability to transactivate ARE-driven genes as TCF11 (with a 
molecular mass of approximately 140-kDa estimated on Laemmli 
SDS-PAGE gels) 59 . Both the full-length Nrfl and TCF11 forms can 
also be processed by selective proteolysis to give rise to the putative 
activated forms of between about 85-kDa and 65-kDa [which lack the 
ER-anchoring N-terminal domain (NTD, aa 1-124, that negatively 
regulates Nrf 1 ), but retain either the entire or the essential portions of 
TADs]. They may also be further processed to generate various 
TADs-deficient isoforms such as those of approximately 55-kDa, 
36-kDa and 25-kDa (designated NrflpVLCR-Fl, Nrfly and NrflS, 
respectively) (Fig. 1; see refs. 11,39,60). These truncated variants are 
neither targeted to the ER nor recovered in membrane fractions 38,39 
(also see Figs. 2e, and S5). 

The LCR-Fl/Nrfip polypeptide can also arise by internal trans- 
lation starting between Met 289 and Met 297 codons within the longer 
mRNA transcripts 14,53 . It migrates at a mass of 55-kDa in the pH 7.0 
LDS/NuPAGE gel (Fig. 2; see refs. 38,39), but the pH 8.9 Laemmli 
SDS-PAGE allows its mobility to be exhibited at an estimated size of 
65-kDa 59,61 (which was called p65Nrfl in ref. 61). By comparison 
with the full-length Nrfl, the 55-kDa LCR-Fl/Nrfl P form lacks both 
the NTD and the essential transctivation domain AD1, but still con- 
tains AD2, NST and SR domains that are also required for full activ- 
ity. It is thus postulated that LCR-Fl/Nrfl P is a soluble nuclear 
activator with a poor transactivation activity; this has been confirmed 
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Figure 2 | The stability of longer Nrfl isoforms in response to changes in glucose concentration, (a) Examination of the stability of Nrfl isoforms. COS- 
1 cells were transfected with an Nrfl expression construct for 6 h, and were allowed to recover from transfection for 16 h in the fresh 25 mM-glucose 
medium. The cells were treated with cycloheximide (CHX, 50 ug/ml) and/or MG132 (5 uM) for the indicated times before being disrupted. The 
abundance of Nrfl proteins was determined by immunoblotting. The left upper shows a cropped image from the film that was exposed to the immunoblot 
for a shorter time to eliminate intense bands. The intensity of blots was calculated by dividing the values for Nrfl with that for P-actin. The relative amount 
of the 120-, 95- and 85-kDa proteins at the indicated times ( [A],) was normalized to [A] 0 (i.e. the value of 120-kDa at t 0 after CHX), and is shown 
graphically (right, mean ± S.D, n = 4), along with their half-lives (t 1/2 )- (b,c) Stability of Nrfl isoforms in cells grown in 25 mM or 5.5 mM glucose 
medium upon blockage of glycosylation by tunicamycin (TU). Nrfl -expressing cells were transferred to 25 mM (b) or 5.5 mM (c) glucose medium, to 
which was added TU ( 1 ug/ml) and/or MG132 (5 uM) for the indicated times. Following western blotting, the relative abundance of the 120-, 95- and 85- 
kDa Nrfl proteins was calculated after normalization to [A] 0 of 120-kDa at t 0 after TU and shown graphically [right, mean ± S.D, n = 4). (d,e) Thenon- 
glycosylated 95-kDa Nrfl isoform is not protected by membranes. Nrfl-expressing cells were treated with TU ( 1 ug/ml) and/or ALLN (5 ug/ml) for 18 h 
in the fresh 25 mM-glucose medium, and then subjected to subcellular fractionation (e), followed immediately by membrane proteinase protection 
assays (d). The resulting products were identified by immunoblotting with V5 antibody. Calreticulin (CRT) was an ER-luminal marker, that existed in 
ER-enriched subcellular fractions purified by centrifuging at 39, 000 X g(39 K) and 100, 000 X g(100 K). Sec61 a and Lamin-a were used to verify ER and 
nuclear envelope (NE) membranes. 
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by us and others 14,35,51,52 . An exception was made by additional obser- 
vations that LCR-Fl/Nrfl(3 was thought to be a significant dom- 
inant-negative inhibitor of ARE-driven gene transcription 
mediated by the full-length Nrfl and/or Nrf2 factors 61 . The disparity 
suggests a possibility that LCR-Fl/Nrfip is unstable so that it is 
rapidly degraded to yield those lower molecular weight isoforms of 
between 46-kDa and 25-kDa (Fig. 2; see refs. 11,39,60). In fact, the 
36-kDa Nrfly and the 25-kDa NrflS (both arise from in-frame 
translation and selective proteolysis) were shown to be a bona fide 
dominant-negative form that lacks all the potential transactivation 
domains AD1, AD2, NST and/or SR regions 11,39,51 (also see Fig. S5C). 
This is based on the fact that when they are over-expressed, these 
small isoforms can competitively interfere with the assembly and 
function of the active transcription factor complex (i.e. Nrfl or 
Nrf2), thereby inhibiting induction of target genes. As expected, 
production of both the 55-kDa Nrfl [3 activator and the dominant- 
negative 36-kDa Nrfly form (Fig. S5D), which are localized in the 
nucleus (Fig. S5B), was blocked by the N r fr M2S9 - 297L mutant (in which 
the internal translation start codons at Met 289 , Met 292 , Met 294 and 
Met 297 were mutated into leucine residues), resulting in an increase 
in the transcativation activity of P sv40 nqol-ARE-\uc reporter gene by 
approximately 50% more than the activity mediated by the wild- type 
Nrfl factor (Fig. S5C). In addition, generation of the dominant- 
negative 25-kDa Nrfl S form was prevented by the Nrfl p M548L mutant 
(in which another potential in-frame translation start codon at 
Met 548 of Nrfl was mutated into leucine, Fig. S5D), which exhibited 
an increased activity when compared with the wild-type Nrfl P factor 
(Fig. S5C). 

Additional two splice variants were originally designated as Nrfl 
clones A767 and AD 51,52 , which contain a deletion of the functional 
translational start and stop codons, respectively. Removal of the first 
translation start codon results in the generation of a small form 
(called NrflAN in Fig. 1), in which the first N-terminal 181-aa 
region, covering the entire NTD and an one-third portion of the 
AD1, is substituted with a dodecapeptide MGWESRLTAASA 
(GenBank accession No. NM_001 130453.1). For this reason, we 
postulate that NrflAN represents a constitutively activated form of 
Nrfl (ref. 2). By contrast, the Nrfl splice variant AD has a deletion 
of the translation stop codon, but gains a change in the second half of 
the leucine zipper motif (GenBank accession No. AF071084.1) 52 . 
Within this variant, the C-terminal 72 aa of the wild-type Nrfl, that 
are highly positively charged, are exchanged for an alternative 80-aa 
stretch that is enriched with negatively charged residues and contains 
a putative integral transmembrane (TM) segment situated at the C- 
terminal end. Moreover, the third Nrfl variant (originally called 
clone A10) possesses a 137-aa deletion covering the SR and Neh6L 
regions 51 . For convenience, Nrfl variant clones AD and A10 are 
renamed as NrflD or NrflAS, respectively (Fig. 1), but whether they 
have a capability similar to that for the wild-type Nrfl to be anchored 
within the ER membrane is not identified. 

Interestingly, the three long ectopic 120-, 95- and 85-kDa Nrfl 
isoforms are forced to express at an approximate ratio of 1.0 : 0.6 : 0.3 
in COS- 1 cells that had been transfected with an expression plasmid 
for wild-type Nrfl (Fig. 2a). The major ER-resident form of Nrfl is 
the 120-kDa glycoprotein with a dynamic membrane behaviour, 
which is markedly distinct from the minor 95- and 85-kDa nongly- 
cosylated and/or deglycosylated isoforms. Both of the 95- and 85- 
kDa Nrfl are rapidly digested by proteinase K (PK) in membrane 
protection reactions 39,62 , indicating that they are positioned on the 
cyto/nucleoplasmic sides of membranes. In fact, the 95-kDa, 85-kDa 
and other short isoforms are recovered predominantly in the soluble 
nuclear and/or cytosolic subcellular fractions 39 . These observations 
led us to investigate the order of events whereby these three Nrfl 
isoforms are translocated into the ER lumen and/or retrotranslocated 
across the membranes, and to consider how the process is dictated by 
glucose consumption and deprivation. 



The stability of Nrfl isoforms arising from its post-translational 
modifications in vectorial processes. To elucidate the order of 
events by which Nrfl isoforms arise from distinct post-trans- 
lational modifications (i.e. glycosylation, deglycosylation and/or 
proteolysis) in response to changes in the concentration of glucose, 
we have performed a series of experiments in the presence or 
deficiency of glucose in the media, to which added are inhibitors of 
N-linked glycosylation, along with proteasome inhibitors. This 
facilitates to assess what impact glucose levels had on the 
appearance of specific Nrfl isoforms. 

Firstly, to determine the stability of 120-, 95- and 85-kDa Nrfl 
isoforms, cells expressing wild-type Nrfl were grown in medium 
containing 25 mM glucose (control), and treated with cycloheximide 
(CHX, that inhibits protein biosynthesis) alone or together with 
proteasome inhibitors MG132. Western blotting showed that the 
abundance of all three longer Nrfl proteins was significantly 
decreased following 2 h-treatment with CHX, whilst co-treatment 
of cells with MG132 prolonged the turnover of 120-kDa Nrfl gly- 
coprotein over 4 h, but it markedly increased the abundance of the 
95-kDa and 85-kDa protein until 8 h after treatment (Fig. 2a, left). 
The stoichiometric graph (right) showed a tendency to reveal a 
marked difference in the turnover of the 120-, 95- and 85-kDa pro- 
teins that was determined with their half- lives estimated to be 1 5 1 , 93 
and 27 min, respectively, after treatment of cells with CHX. Upon 
addition of MG132 the stabilities of 120-, 95- and 85-kDa Nrfl were 
substantially increased (left), giving estimated half-lives of 342, 475 
and 885 min (right graph), respectively. The discrepancy between the 
upward trends in their stabilities suggest that the extra-luminal 85- 
kDa Nrfl protein is rapidly degraded primarily through the 26S 
proteasome-mediated proteolysis pathway, whilst most of the 120- 
kDa glycoprotein is protected by the intact membranes against pro- 
teolytic degradation by 26S proteasomes, and a portion of the 95-kDa 
protein may also be partially protected by membranes during its 
topogenesis. In addition, treatment with the calpain inhibitor-I 
[CI, also called ALLN (N-Acetyl-Leucyl-Leucyl-Norleucine)] 
showed that non-proteasome-mediated pathways may also contrib- 
ute to the proteolytic processing of 120-kDa and 95-kDa Nrfl 
(ref. 39). 

Secondly, to determine the effect that inhibition of Nrfl glycosyla- 
tion has on both its protein stability and its post-translational modi- 
fication, cells expressing wild-type Nrfl were allowed to be grown in 
medium containing either 25 mM glucose (control) or medium con- 
taining 5.5 mM glucose (restriction), and treated with TU (an inhib- 
itor of oligosaccharyltransferases) to block N-linked glycosylation of 
newly-synthesized proteins, or plus MG132 or ALLN to block pro- 
teolysis. As shown in Fig. 2b, treatment with TU caused a time- 
dependent block in the appearance of the 120-kDa Nrfl glycoprotein 
(left), which was reduced to 50% by 325 min estimated (right graph), 
and thereafter it gradually disappeared. Treatment of cells exposed to 
TU with MG132 (or ALLN in Fig. 2d) did not apparently increase the 
amount of the 120-kDa Nrfl glycoprotein (Fig. 2b, left), though these 
chemicals modestly slowed its turnover so that its half-life time was 
prolonged from about 325 min to 502 min (right graph). After 2 h 
treatment of cells with TU, the disappearance of the 120-kDa Nrfl 
glycoprotein was accompanied by an obvious increase in the abund- 
ance of the major 95-kDa Nrfl protein (Fig. 2b, left), which is thus 
assumed to comprise both deglycosylated and non-glycosylated 95- 
kDa proteins. Thereafter, expression of the major 95-kDa Nrfl 
(along with the minor 85-kDa Nrfl, left middle) proteins returned 
to low levels at 8 h after TU treatment, but could be augmented by 
treatment with MG132. These data demonstrate that the full-length 
Nrfl protein is glycosylated to become the 120-kDa glycoprotein 
before it is deglycosylated to yield the 95-kDa isoform (i.e. this pro- 
cess is termed deglycosylation). Under certain circumstances, the 95- 
kDa isoform can also arise from the failure to glycosylate Nrfl after 
biosynthesis (i.e. termed non-glycosylated 95-kDa Nrfl protein), as 
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was apparent during the prolonged exposure to TU, particularly in 
the presence of MG132 (Fig. 2b, left). These observations suggest that 
the rapid degradation of the deglycosylated/non-glycosylated 95- 
kDa Nrfl protein occurs predominantly through 26S proteasome- 
mediated proteolysis pathway, whilst most of the luminal-resident 
120-kDa glycoprotein maybe subject to a non-proteasome-mediated 
degradation pathway (e.g. Calpain). Furthermore, a small fraction of 
the non-glycosylated 95-kDa Nrfl protein, but neither of the 120- 
kDa glycoprotein or the cleaved 85-kDa isoform, was expressed in 
COS-1 cells that had been incubated with TU for 18 h (Fig. 2e, left 5 
lanes). By contrast, co-treatment of cells with TU and ALLN caused a 
marked increase in the abundance of the non-glycosylated 95-kDa 
Nrfl protein (Fig. 2e, right 5 lanes). Although the non-glycosylated 
95-kDa Nrfl protein was recovered in either ER or NE membranes, 
rather than the cytosolic fraction, purified from TU-treated cells 
(Fig. 2e), it was neither protected by membranes against PK digestion 
(Fig. 2d), nor did it exhibit potent transcriptional activity (see ref. 39). 
These results suggest that the non-glycosylated 95-kDa Nrfl protein 
may be mislocated and/or misfolded before it is then degraded by 
proteasomes. 

Thirdly, changes in the glycosylation status of Nrfl were exam- 
ined upon restriction of glucose (i.e. cells allowed to be grown in 
medium containing 5.5 mM glucose following transfer from med- 
ium containing 25 mM glucose). After synthesis of Nrfl glycopro- 
tein was blocked by addition of TU, glucose restriction resulted in 
a significant loss of the 120-kDa glycoprotein band (Fig. 2c, left), 
with an estimated half-life of 209 min in the absence of MG132, or 
279 min in the presence of MG132, respectively (right graph). The 
expression of the 95- and 85-kDa Nrfl proteins was maintained at 
low levels, but their abundances were markedly increased upon 
addition of MG132. These data suggest that production of the 
120-kDa Nrfl glycoprotein is dependent on the presence of glu- 
cose, whereas a portion of the 95- and/or 85-kDa proteins may be 
derived from the 120-kDa glycoprotein. 

Vectorial processes by which Nrfl isoforms arise from distinct 
post-translational modifications in the intracellular response to 
changes in glucose supply. To elucidate the membrane-topogenic 
vectorial process of Nrfl in which the protein is glycosylated and/or 
deglycosylated in response to changes in glucose supply, cells 
expressing wild-type Nrfl were allowed to be starved for 16 h in 
medium containing 1 mM glucose, and then transferred to fresh 
control medium containing 25 mM glucose to which was added 
DMSO vehicle, TU alone, or TU plus MG132 for various periods 
of time. As shown in Figure 3, the result reveals a time-dependent 
process in which Nrfl is glycosylated in the ER lumen and thereafter 
is deglycosylated in the extra-luminal compartments in response to 
acute glucose addition (i.e. cells expressing wild-type Nrfl were 
allowed to recover overnight in medium containing 1 mM glucose 
before they were placed in medium containing 25 mM glucose). In 
this glucose addition experiment, the 95-kDa Nrfl protein was the 
most abundant isoform expressed in glucose-starved cells that had 
been grown in medium containing 1 mM glucose. However, upon 
exposure to 25 mM glucose a gradual increase in the 120- and 95- 
kDa proteins was observed giving similar levels 2 to 4 h after 
exposure to 25 mM glucose as was seen in control cells, and 
thereafter, the 120-kDa glycoprotein became dominant (Fig. 3a, 
left 6 lanes). After 2 h treatment with TU, the 120-kDa Nrfl band 
disappeared and then was replaced by the enhanced 95-kDa protein 
(Fig. 3b, right 6 lines). The rapid turnover of newly- synthesized 120- 
kDa Nrfl after 60 min was not prevented by treatment with MG132 
because the proteasome inhibitor did not obviously elevated amount 
of the nascent 120-kDa protein (Fig. 3c right 6 lines). These data are 
consistent with the hypothesis that Nrfl is first synthesized as a 95- 
kDa non-glycosylated form, targeted to the ER, and then translocated 
into the lumen of this organelle, where it is glycosylated to become 



the 120-kDa isoform. By contrast, amount of the major 95-kDa Nrfl 
proteins was significantly augmented by MG132, as accompanied by 
a certain increase in expression of the 85-kDa Nrfl and other short 
isoforms (Fig. 3c, right 6 lines). This result suggests that when Nrfl 
failed to be glycosylated, its 95-kDa non-glycosylated protein is 
rapidly processed by proteasome-mediated proteolysis to generate 
multiple polypeptides of between 85-kDa and 25-kDa. 

Next, to elucidate the topogenic vectorial processes whereby Nrfl 
is glycosylated and/or deglycosylated in response to glucose-free 
conditions, cells expressing wild-type Nrfl were allowed to be grown 
for 16 h in medium containing 25 mM glucose and then starved for 
the indicated times in glucose-free medium that contained DMSO 
vehicle, Z- VAD-fmk (which is abbreviated to ZVF, an inhibitor of N- 
linked deglycosylation) alone or plus MG132 (to block the protea- 
somal degradation). As shown in Figure 3a (right 6 lanes), in cells that 
had been grown in medium containing 25 mm glucose, complete 
glucose deprivation triggered a time-dependent loss of the 120-kDa 
Nrfl glycoprotein. Its disappearance was also accompanied by a 
modest decrease in the abundance of 95- and/or 85-kDa proteins 
by 4 h after glucose deprivation, and their decreased levels remained 
until 8 h after glucose deprivation (c.f, Fig. 3a, right 6 lanes with 
Fig. 4b, left 6 lanes). Inhibition of N-linked deglycosylation by Z- 
VAD-fmk had little impact on the turnover of the 120-kDa Nrfl 
glycoprotein (Fig. 4b), changing its half-life from 227 min to 
262 min (Fig. 4a), or 268 min in the presence of MG132) (Fig. 4b, 
right 6 lanes). The data suggest that the 120-kDa Nrfl is protected by 
membranes against proteasome-mediated degradation and this gly- 
coprotein turnover is thus assumed to occur through a proteasome- 
independent mechanism, particularly before it is allowed to be 
deglycosylated. Interestingly, the inhibition of glucose deprivation- 
stimulated deglycosylation of 120-kDa Nrfl by Z- VAD-fmk also 
caused a marked decrease in the levels of 95- and 85-kDa proteins 
(Fig. 4b, left 6 lanes) to 50% of their original levels by 270 min and 
43 min estimated respectively (Fig. 4a), suggesting that deglycosyla- 
tion of 120-kDa Nrfl glycoprotein may proceed with generation of 
the 95- and 85-kDa isoforms. Inhibition of the proteasome by 
MG132 rendered a time-dependent increase in expression of the 
95- and 85-kDa Nrfl proteins (Fig. 4b, right 6 lanes), so that their 
half-lives were prolonged to 420 min and 365 min, respectively 
(right graph). Collectively, these results suggest that a major fraction 
of the 95-kDa isoform arises from deglycosylation of the 120-kDa 
Nrfl glycoprotein after it has been transferred from the ER lumen to 
extra-luminal cyto/nucleoplasmic compartments, whereupon the 
95-kDa Nrfl protein is proteolytically processed by 26S proteasomes 
to yield the 85-kDa isoform. Thereafter, the 95-kDa and/or 85-kDa 
Nrfl proteins are subject to further proteolytic degradation to yield 
several short isoforms of between 55 kDa and 25 kDa, but detailed 
mechanisms require further study. 

The changing topology of Nrfl within and around membranes is 
determined by its TM1 in cooperation with other semihydro- 
phobic amphipathic regions. Our previous work has shown that 
the NHB1 -associated TM1 sequence (aa 7-26) anchors Nrfl with- 
in ER membranes and is itself orientated in an N cyt /Ci um fashion 
within the membrane 2,38 " 40 . To gain an insight into the membrane- 
topological folding of Nrfl within the ER, we created several double 
fluorescence chimaeric proteins, in which Nrfl and its truncation 
mutants are either sandwiched in between DsRed and GFP or fused 
C-terminally with GFP alone (Fig. 5a). These sandwiched fusion pro- 
teins were subjected to classic proteinase protection assay; this proce- 
dure, called double fluorescence protease protection (dFPP), was 
established on the base of the fluorescence protease protection 
(FPP) assay to identify the membrane-topology of GFP chime- 
ras 63-64 . As shown in Figure 5b, membrane protease protection 
reactions of intact ER-enriched fractions from cells expressing the 
expression construct for DsRed/N65/GFP (in which N65 represents 
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Figure 3 | Post-translational processing of Nrfl into longer isoforms in response to changes in glucose concentration, (a) Glycosylation and 
deglycosylation of Nrfl in response to glucose supply or glucose starvation. Following transfection with the wild-type Nrfl expression construct, a group 
of COS-1 cells {left 6 lanes) was allowed to starve in 1 mM-glucose medium for 16 h, before they were transferred to fresh 25 mM-glucose medium (i.e. 
glucose supply) . Another group of the cells (right 6 lanes) was left to recover for 16 h in 25 mM-glucose medium, before transfer to glucose-free medium 
(i.e. glucose deprivation). Thereafter, these two groups of cells were disrupted at the indicated times after glucose addition or glucose deprivation, and 
then examined by western blotting. The left upperpanels show two cropped images from the film that was exposed to the immunoblot for different shorter 
times to eliminate intense bands. The right graph shows the relative abundance of the 120-, 95- and 85-kDa proteins (mean ± S.D, n = 4) that were 
calculated after normalization to [A] 0 of 120-kDa at t 0 after transfer 25 mM-glucose medium to the starved cells that had been grown in glucose-free 
medium. (b,c) The vectorial process by which Nrfl is post-translationally modified was determined by addition to the 1 mM-glucose starved cells of 
25 mM-glucose in the presence of DMSO vehicle, TU alone, or TU plus MG132. After transfection, Nrfl -expressing cells were starved in 1 mM- glucose 
medium for 16 h before transfer to 25 mM-glucose media containing DMSO, TU (1 ug/ml), or plus MG132 (5 uM) for the indicated times. The relative 
abundance of the 120-, 95- and 85-kDa Nrfl proteins was determined by western blotting and calculated as described above (right, mean ± S.D, n = 4). 
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Figure 4 | Post-translational processing of Nrf 1 into longer isoforms in 
response to glucose deprivation. The vectorial process by which Nrfl is 
post-translationally modified was determined by the treatment of cells that 
is subject to glucose deprivation along with Z-VAD-fmk (which blocks re- 
linked deglycosylation of glycoprotein). Following transfection, Nrfl- 
expressing cells were allowed to recover from transfection in the fresh 
25 mM-glucose medium for 16 h, before transfer to fresh glucose-free 
medium containing DMSO vehicle, Z-VAD-fmk (ZVF, 20 uM) alone, or 
ZVF plus MG132 (5 uM) for the indicated times. The abundance of the 
120-, 95- and 85-kDa Nrfl proteins was determined by western blotting 
(b, left), as described above in Fig. 3a (right 6 lanes). The relative intensity of 
immunoblots for these three longer isoforms was calculated 
(mean ± S.D, n = 4), and is shown graphically (a and b right), with 
distinct trends in their half-lives ( t t / 2 ) estimated. 

the N-terminal 65 aa of Nrfl, containing the TMl-embedded NHB1 
sequence), followed by immunoblotting with antibodies against 
either DsRed or GFP, revealed that the full-length DsRed/N65/ 
GFP sandwiched protein was integrated into the membranes (right 
cartoon). The DsRed epitope was almost completely degraded by PK 
(Fig. 5bl). The resulting loss of DsRed from DsRed/N65/GFP gave 
rise to a major PK-digested fragment of approximately 55-kDa with 
the GFP epitope facing the lumen; but this fragment appeared to be 
evidently protected by membranes against further digestion by PK, 
even in the presence of 1% Triton X-100 (TX) (Fig. 5b2). The data 
demonstrate that the TM1 region of N65 is orientated in an N cyt /Q um 
fashion spanning ER membranes, while the water-soluble DsRed and 
GFP portions are positioned on the cytoplasmic and luminal sides of 
membranes, respectively (Fig. 5b, right cartoon). Further membrane 
PK protection assays of the N65/GFP fusion protein revealed that its 
membrane-topology appeared similar to that of the sandwiched 
DsRed/N65/GFP protein (Fig. 5c, right cartoon), indicating that 
the N cyt /Ci um orientation of TM1 in the N65 peptide appears to be 
unaffected by attachment of the DsRed epitope immediately to the 
front of the TMl-associated NHB1 sequence. It is important to note 
that, although the local membrane-topology of the NHB1 sequence 



in Nrfl appears to resemble those of ATF6 and SREBP-1 (Fig. S2), it 
has neither Site-1 nor Site-2 protease cleavage sites (it is notable that 
both proteases are exclusively located in the Golgi apparatus rather 
than the ER 65 67 ). It is therefore assumed that the NHB1 sequence of 
Nrfl is not processed in the ER 38 through a mechanism accounting 
for the successively proteolytic processing of ATF6 and SREBP-1 
(refs. 65-67). 

To learn more about the overall membrane-topological folding of 
Nrfl with the ER, the full-length Nrfl protein was sandwiched in 
between DsRed and GFP to yield DsRed/Nrfl/GFP. The membrane 
PK protection assays showed that the N-terminal DsRed portion is 
retained on the cytoplasmic side of membranes where it is not pro- 
tected so that it was almost completely digested within 30 min by PK 
(Fig. 5dl). By contrast, a luminal 32-kDa peptide, with its C-terminal 
end being tagged by GFP, was protected by ER membranes from 
digestion of DsRed/Nrfl/GFP by PK (50 ug/ml), even in the pres- 
ence of TX (Fig. 5d3, left). Further processing of the 32-kDa GFP- 
containing peptide by 100 Ug/ml of PK gave rise to two smaller 
fragments of 28-kDa (free GFP) and 31-kDa (containing most of 
TMc fused with GFP). Interestingly, none of these three GFP- 
containing peptides of between 28 kDa and 32 kDa with its C- 
terminus facing the lumen were observed in the membrane PK 
protection assay of DsRed/Nrfl AC /GFP [in which the Nrfl AC mutant 
lacks the C-terminal 56 amino acids of Nrfl including a net 
negatively charged amphipathic TMc peptide (Fig. SI), that con- 
tains a potential membrane-spanning helix-helix interaction motif 
AxxxAGxxxS, of the type described previously 60 ] (Fig. 5d3, right). 
These observations had led us to postulate that during topogenesis of 
Nrfl, if its TMc peptide is integrated through association with mem- 
branes in an N cyt /Ci um orientation similar to that of TM1, a flexible 
hinge region located between the TM1 and TMc segments should be 
allowed to orientate the intervening polypeptide in an Ni um /C cyt 
fashion (i.e. its N- and C-termini facing the luminal and cytoplasmic 
sides of membranes, respectively). Such a sequence (e.g. a proline- 
kinked amphipathic semihydrophobic TMp hinge region within the 
Neh6L domain) was deduced from bioinformatic analysis (Fig. SI), 
and those data obtained from glycosylation mapping, and mutagen- 
esis mapping linked to membrane proteinase protection assays 
(unpublished). 

Protease digestion of DsRed/Nrfl/GFP and DsRed/Nrfl AC /GFP 
also generated a GFP-fused polypeptide of either 52-kDa (which 
includes a large C-terminal portion of Nrfl situated between the 
putative TMp and TMc peptides, Fig. 5d2, left) or 49-kDa (which 
contains the C-terminal portion of Nrfl but lacks the TMc peptide, 
Fig. 5d2, right). Intriguingly, the 52-kDa fragment was more sensitive 
to PK and further processed to yield a TX-resistant 32-kDa TMc- 
GFP facing the lumen. Similarly, two polypeptides of 50-kDa and 32- 
kDa were also observed in PK digests of the Nrfl/GFP chimeras (in 
which the wild-type Nrfl is fused C-terminally with GFP, Fig. 5el). 
By contrast, the 49-kDa fragment lacking the TMc segment appeared 
more resistant to digestion by PK, even in the presence of TX 
(Fig. 5d2, right). Moreover, our previous work showed that protei- 
nase digestion of endogenous 120-kDa Nrfl for 30 min using ER 
membranes yielded peptides of 55, 46, 36 and 32 kDa 39 , though most 
of these disappeared upon incubation with PK for 60 min 2,40 . Taken 
together, these results suggest a possibility that a substantial C- 
terminal portion of Nrfl (with an estimated mass of 52 ~ 55 kDa) 
situated between the TMp and TMc peptides is initially translocated 
into the ER lumen, and subsequently some basic-rich regions adjoin- 
ing TMp may be partially repartitioned out of membrane into the 
cyto/nucloplasmic side where they are proteolytically processed into 
several short peptides (for the proposed models, see Fig. S3). By 
contrast, the TMc-adjoining region in Nrfl appears to be integrated 
in a potent N cyt /Q um orientation spanning the TX-resistant mem- 
branes with its C-terminus facing the ER lumen (Fig. S3), but it is 
dynamically moved out of membranes into the cytoplasmic side, so 
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Figure 5 | The membrane-topological folding of Nrfl requires TMI to cooperate with other semihydrophobic regions, (a) Structural diagrams 
of five chimeras indicated. Nrfl AC lacks TMc-adjoining 56 amino acids of Nrfl, whereas, its N65 contains the TMI -associated NHB1 sequence. The 
locations of TMI and the other three possible membrane-associated regions TMi, TMp and TMc are indicated. It is to note that attachment of DsRed 
immediately to the N- terminal end of NHB1 does not alter the orientation of TMI spanning membranes. (b,c) COS-1 cells expressing DsRed/N65/GFP 
(b) and N65/GFP (c) were subjected to subcellular fractionation in order to purify the intact ER fraction. Subsequently, equal amounts (50 ug of protein 
in 40 ul of 1 X isotonic buffer) of ER were examined by membrane proteinase protection reactions that were carried out by incubating (on ice) each of the 
subcellular fractions with 50 ug/ml ( + ) or 100 ug/ml ( + + ) of proteinase K (PK) in the presence ( + ) or absence ( — ) of 1 % Triton X- 100 (TX) for 30 min 
(b) or for the indicated times (c). The reaction products were resolved in a 4-12% LDS/NuPAGE Tris-Bis gel and identified by western blotting with 
antibodies against DsRed (b, upper), GFP (b, middle; c, upper) and CRT. The results reveal that the TMI -associated NHB1 sequence enables N65 to anchor 
in an N cyt /Ci um oriention spanning membranes (left two cartoons), with DsRed and/or GFP portions being positioned in the cytoplasmic and luminal 
sides, respectively. (d,e) Following classic membrane proteinase protection assays of ER-enriched fractions expressing DsRed/Nrfl/GFP (d), DsRed/ 
Nrfl AC /GFP (d) or Nrfl/GFP (e) were carried out as described above. The reaction products were visualized by western blotting with antibodies against 
DsRed (d, upper), GFP (d, middle; e, upper) and CRT. Quantitative analysis of Nrfl/GFP and its fragments (Fig. S5A) revealed that —78% of Nrfl/GFP 
protein was digested by PK for 15 min, whereas the remaining 22% of this protein was further digested to yield two polypeptides of 52-kDa and 32-kDa 
with their C-terminus being fused with GFP. To explain the results obtained from PK digestion of DsRed/Nrfl/GFP and DsRed/Nrfl AC 7GFP, two 
different models were proposed in Fig. S4. 
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that it within the major fraction of Nrfl/GFP is digested to disappear 
in the membrane protection reactions (Figs. 5el and S5A). 

The membrane-topogenic vectorial behaviour of Nrfl controls 
the ability of its TADs to transactivate Gal4-UAS reporter gene 
expression. The above data appear consistent with the notion that 
some regions adjacent to the TM1, TMp and/or TMc peptides may 
control the topological folding of Nrfl and its partial repartitioning 
out of membranes to regulate target gene expression. To test this 
possibility, we combined Gal4-based reporter assays linked with 
protease protection assays (called GRAPPA) to examine whether 
membrane-related vectorial behaviour of Nrfl has an effect on its 
activity to transactivate target gene expression. In these experiments, 
the Gal4 DNA-binding domain (Gal4D) was attached to the N- 
terminus of either wild-type Nrfl or those C-terminally truncated 
Nrfl mutants (Fig. 6a). According to the positive-inside rule (i.e. net 
positively-charged residues adjacent to transmembrane regions are 
positioned on the cyto/nucleoplasmic side of membranes) 68 70 , the 
basic Gal4D portion of 16.8 kDa (which is much smaller than the 
more basic N-terminal 50-kDa portion of ATF6 or SREBP-1, Fig. 
S2B) should not alter the N cyt /Ci um orientation of TM1 within Nrfl, 
so that it may be positioned on the cyto/nucleoplasmic side of 
membranes (Fig. S4A). The resulting ability of various Gal4D/Nrfl 
fusion proteins to transactivate a Gal4D-UAS reporter gene was used 
as a measure of the membrane-topogenic vectorial behaviours of 
Nrfl that may dictate the positioning of its TADs and/or subse- 
quent repartitioning out of membranes to mediate target gene 
expression (Fig. S4B). In this assay, we found that Gal4D/Nrfl-V5 
produced a 360-fold increase in reporter gene activity (Fig. 6a, right), 
even though it was transiently located in the ER 36 . Membrane 
proteinase protection assays on the intact ER fractions showed that 
the full-length Gal4D/Nrfl-V5 and its 28-kDa fragment (which 
comprises about 205 aa possibly located between residues 535 and 
741) were sensitive to PK (Fig. 6, bl & b2), with an exception that a 
residual amount of the 28-kDa fragment with the TMc peptide facing 
the lumen was partially protected by membranes against PK diges- 
tion even in the presence of 1% TX (Fig. 6b2). This suggests that the 
V5-tagged C-terminus of Nrfl might be dynamically partitioned on 
either the cytoplasmic or luminal sides of membranes to extents 
varying with different folding fractions (Fig. S4A). This assump- 
tion is supported by the evidence obtained from the Nrfl/GFP and 
DsRed/Nrfl/GFP chimaeric proteins (Fig. 5, d & e). In addition, 
modest digestion of Gal4D/Nrfl-V5 by 50 ug/ml of PK gave rise 
to three non-tagged polypeptides of 65, 18 and 6 kDa (which 
comprise approximately 510, 140 and 45 aa, respectively) (Fig. 6, 
b3 & b4). They were visualized by cross-reacting with antibodies 
against Nrflfj (this antigen comprises residues 291-741 of Nrfl), 
but largely disappeared following enhanced digestion by 100 ug/ml 
PK. These suggest that all three polypeptides are not sufficiently 
protected by membranes from protease attack, as they may be 
dynamically repositioned into the cyto/nucleoplamic side out of 
membranes. 

We next used the Gal4-based assay to determine whether some of 
the N-terminal regions (e.g. TM1) of Nrfl are integrated within 
membranes. The results revealed that residues 1-181 of Nrfl were 
embedded in the Gal4D/N 181 - V5 chimaeric factor that lacked trans- 
activation activity, suggesting that its NTD (aa 1-124) completely 
repressed the transactivation potential of the N-terminal one-third 
portion (aa 125-181) of AD1 (Fig. 6a). Similarly, although the N298 
polypeptide contains the entire AD1 region (aa 125-298), the Gal4D/ 
N298-V5 chimaeric factor exhibited only a quarter of transactivation 
activity exhibited by wild-type Gal4D/Nrfl-V5 (Fig. 6a, right). 
However, deletion of the N-terminal 80 residues (including the 
TM1 -associated NHB1 sequence) from either N181 orN298 (to yield 
Gal4D/ AN80 N181-V5 and Gal4D/ ANSO N298-V5, respectively), allowed 
their transactivation activities to be rescued from inhibition by the 



NTD of Nrfl. This appears to be consistent with those results 
obtained from mutagenesis mapping of the NTD 38 and membrane 
protease protection assays of both DsRed/N65/GFP and N65/GFP 
fusion proteins (Fig. 5, b & c). Together, these results revealed that 
the TMl-associated sequence (in particular aa 15-24 of Nrfl, see ref. 
38) is essential for integration within membranes. 

Further proteinase protection assays were used to examine 
whether the remaining region of NTD that connects with AD1 is 
either partitioned into the lumen or repartitioned out of membranes. 
Exposure of intact ER-enriched membrane fractions expressing 
Gal4D/N298-V5 or Gal4D/N181-V5 to PK for 30 min yielded a 
fraction of similar V5-tagged polypeptide ladders in the ER lumen 
(Fig. 6, c to f). Digestion of Gal4D/N298-V5 by PK yielded a 4.5-kDa 
peptide that was estimated to contain approximately 25 aa derived 
from Nrfl, suggesting that the Neh5L region resides transiently in the 
lumen (Fig. 6c2). By contrast, PK digestion of Gal4D/N181-V5 
yielded peptides of 14-kDa and 18-kDa that were estimated to con- 
tain roughly 95 and 130 aa derived from Nrfl, respectively (Fig. 6d2), 
but similar PK-digested peptides were not observed in the cases of 
DsRed/N65/GFP and N65/GFP (Fig. 5, b & c). Together with other 
observations by Steffen et al 22 and Tsuchiya et al 48 , these results 
suggest that an N-terminal region possibly located between residues 
55 and 85 may be dynamically repositioned on the cytoplasmic side 
of ER, where it enables Nrfl to be targeted for a proteolytic degra- 
dation pathway. As shown in Figure 6 (c to f), a residual amount of 
other longer polypeptides of between 25-kDa and 58-kDa, which are 
predicted to contain the entire N181 or N298 fused with various 
lengths of Gal4D (only the full-length Gal4D can elicit its DNA- 
binding activity but short degraded peptides lack the activity) were 
observed in 30-min PK digests, suggesting that during initial topo- 
genesis, approximately a quarter of the full-length Gal4D/N181-V5 
or Gal4D/N298-V5 is not tightly anchored by TM1 within mem- 
branes and thus may be released into the lumen, whilst other three- 
fourths of the fusion proteins containing V5-tagged AD1 may have 
been partially repartitioned out of membranes into the cyto/nucleo- 
plasmic side, where they are proteolytically digested to yield multiple 
peptide species. The residual N181- and/or N298-containing frag- 
ments were also further digested following incubation with PK for 
60 min (Fig. S4C), suggesting that the luminal- resident AD1 is cap- 
able of being repartitioned across membranes into the cyto/nucleo- 
plasmic side, enabling transactivation of UAS-Gal4 reporter gene 
(Fig. 6a). The vectorial behaviour of AD1 was also examined by both 
engineered glycosylation mapping of this domain and membrane 
protease protection assays of DsRed/N275/GFP (data not shown 
herein). 

During de novo synthesis, the orientation of TM1 within the NTD 
of Nrfl causes subsequently translated portions to be directed to the 
lumen of the ER. We therefore sought to establish whether the TMi- 
and/or TMp-containing regions (Fig. SI) in Nrfl alter the topological 
distribution of this protein by comparing Gal4D/N181-V5 and 
Gal4D/N298-V5 with Gal4D/N395-V5 (which contains the TMi 
peptide on the C-terminal side of the NST glycodomain), Gal4D/ 
N460-V5 (which contains TMi and AD2), Gal4D/N526-V5 (which 
contains TMi, AD2, and the TMp peptide on the C-terminal border 
of the SR/PEST2 sequence), or Gal4D/N607-V5 (which contains 
TMi, AD2, TMp, SR/PEST2 and Neh6L). As shown in Figure 6 (g 
& h), the results revealed that the N395-, N460-, N526- and N607- 
containing fusion proteins were substantially more sensitive to PK in 
the membrane proteinase protection assays than the N181- and 
N298-containing proteins. When compared with Gal4D/N395-V5 
and Gal4D/N460-V5, the Gal4D/N526-V5 fusion protein was rela- 
tively resistant to digestion, whereas Gal4D/N607-V5 was more or at 
least as sensitive to digestion than others tested. These data suggest 
that the membrane-topological folding of Nrfl needs TMI to coop- 
erate with other regions such as TMi and TMp, and in turn the 
positioning of these regions is monitored by their flanking domains, 
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Figure 6 | Membrane-topogenic vectorial behaviour of Nrfl controls its ability to transactivate Gal4-UAS reporter gene, (a) Schematic structure of 
Gal4D fusion proteins with Nrfl or its truncated mutants used in reporter assays. The locations of TMI, TMi, TMp and TMc are indicated. The AD1, 
NST, AD2 and/or SR regions in Nrfl serve as TADs. The transcriptional activity of Gal4D/Nrfl chimeric factors is shown (right). Luciferase activity was 
measured in lysates of COS-1 cells expressing each Gal4D/Nrfl factor, P TK UAS/Gal4-Luc and P-gal plasmid. The data were calculated as a fold change 
(mean ± S.D, n = 9) over the Gal4D control after normalization to the P-gal value. Statistical significance was determined by the Student t-test or 
MANOVA. When compared with those obtained from Gal4D/Nrfl-V5, significant increases are shown ($, p < 0.05; $$, p < 0.001; n = 9), as are 
significant decreases (*, p < 0.05; **, p < 0.001; n = 9). (b) Cells expressing Gal4D/Nrfl-V5 were subjected to subcellular fractionation and membrane 
proteinase protection reactions for 30 min with either 50 ug/ml ( + ) or 100 ug/ml ( + + ) of proteinase K (PK) in the presence ( + ) or absence ( — ) of 1% 
(v/v) Triton X-100 (TX). The reaction products were examined by immunoblotting with antibodies against the V5 tag (bl, b2) or Nrfip {b3, b4). 
Calreticulin (CRT) served as an ER luminal-resident protein marker. The intensity of cross-reacting bands in some blots was calculated by the ImageJ 
programme, (c to f) The intact ER membrane PK protection assays ofGal4D/N298-V5 (c, e) andGal4D/N181-V5(d, f) from 30 min digestion by PK. The 
electrophoretic mobilities of these two full-length protein and their PK-digested polypeptides were analyzed by 4-12% LDS/NuPAGE gels. Subsequently, 
the intensity of immunoblots for multiple V5-tagged polypeptides was calculated after normalization to the corresponding value of CRT in each well. 
Their mean values obtained from three independent experiments are shown graphically (e and f , n = 3). (g,h) The intact ER membrane proteinase 
protection reactions of Gal4D/N395-V5, Gal4D/N460-V5, Gal4D/N526-V5 or Gal4D/N607-V5 over 30 min were performed as above. The intensity of 
immunoblots with the V5 tag was calculated after normalization with the value for CRT. 
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though the detailed molecular basis requires for further studies. 
Further comparison of their abilities to transactivate the Gal4-UAS 
reporter gene (Figs. 6a, and S4B) revealed that the TMi-adjoining 
AD2 (and SR) positively regulates Nrfl, whilst the TMp-adjoining 
Neh6L domain negatively regulates this factor. By comparison with 
the wild-type Nrfl glycoprotein of 120-kDa, relatively small amounts 
of the various NST-containing Gal4D fusion proteins were glycosy- 
lated (Fig. S4D), suggesting that the NST glycodomain in the basic 
Gal4 fusion protein may be repartitioned out of membranes where it 
is deglycosylated. 

Discussion 

For reasons that are largely historical or involve difficulty of experi- 
mentation, the vast majority of researchers in the field have been 
almost entirely fixated on the water-soluble Nrf2 (to the tune of 
hundreds of publications per year) so far, whereas the membrane- 
bound Nrfl appears to be comparatively ignored 1,2 , even though the 
latter is actually essential for regulating critical homeostatic and 
developmental genes. Now, Nrfl seems to be coming more into its 
own lately, particularly with the demonstration that it is a critical 
regulator of 26S proteasomal subunit expression under conditions of 
proteasomal challenge (i.e. with chemotherapeutic drugs) 22 25 . Our 
past work showed that much of the cellular Nrfl (but not Nrf2), with 
several domains being more conserved with its ancestral proteins 
CNC and Skn-1, is located in the ER but neither in the Golgi nor 
mitochondria 36,38-39 , making it an intriguing question how Nrfl might 
be regulated and what its main functions are. Our present study 
reveals a hitherto unknown intrinsic mechanism by which the 
membrane-topogenic behaviour of Nrfl controls both its post- 
translational modification and functional activity. This addresses a 
significant topic on how Nrfl is allowed to move in and out of ER 
membranes, and/or release away from the ER in a regulated fashion 
into the nucleus, enabling transactivation of target genes; thus it is a 
membrane-spanning protein that is dynamically moving with chan- 
ging topology. The order of membrane-topogenic events is defined, 
whereby the newly-translated Nrfl is translocated into the ER lumen 
where it is glycosylated through its NST domain, and thereafter the 
glycoprotein is repartitioned and retrotranslocated across mem- 
branes into the cyto/nucloplamic compartments where it is deglyco- 
sylated, enabling transactivation of Nrfl -target genes before this 
deglycoprotein is proteolyticalfy degraded by the proteasome. 
Herein, we have also provided evidence that normal glucose supply 
and glucose deprivation alter post-translational modification of Nrfl 
(i.e. N-linked glycosylation and deglycosylation, as well as possibly 
proteolysis mediated by proteasome and/or calpain). 

Importantly, the distinct biological functions of the membrane- 
bound Nrfl and the water-soluble Nrf2 are determined by differ- 
ences in their primary structures. Since Nrfl is essential for survival, 
it must exclusively regulate certain genes that are critical for cellular 
homeostasis. As such, accumulating evidence indicates that Nrfl is a 
dynamic membrane-spanning protein that exists with multiple topo- 
logical isoforms (called topoforms), each with a distinct function. 
According to current knowledge of the folding of proteins within 
membranes 6870 , the membrane-topology of Nrfl is likely to be deter- 
mined by hydrophobic, semihydrophobic, amphipathic, and other 
topogenic signals (collectively designated topogon) within the 
protein. The topogon is co-translationally decoded by molecular 
machines (e.g. translocon) into an initial topological struc- 
ture partitioned within and around the specific membrane lipid 
microdomain. During subsequent folding of Nrfl, potential 
intramolecular interactions between its own helical regions and 
intermolecular interactions of its helical regions with those in other 
membrane proteins (e.g. retro translocon or flippase) and/or with 
membrane lipids dictate the formation of the final topological 
structure and position of the NHB1-CNC factor within and around 
membranes. 



To be explicit, we propose a model for the membrane-topogenic 
vectorial process of Nrfl (Figs. 7 and S4A), in which Nrfl can adopt 
distinct dynamic membrane-topologies that are determined by its N- 
terminal TM1 sequence (aa 7-26), which itself exists in an N cyt /Ci um 
orientation within membranes. Our previous mutagenesis mapping 
data 38 revealed that, within the TMl-associated NHB1 sequence, the 
core hydrophobic residues 15-24 are essential for anchoring Nrfl 
within ER membranes so that it adopts an N cyt /Ci um orientation, 
whereas its C-terminal flanking aa 38-44 contain a potential REIIL 
motif, but do not serve as a cleavage site by Site-1 protease. 
Otherwise, loss of the TM1 peptide resulted in production of an 
unstable peripheral membrane-associated or free nuclear mutant 
proteins 38 40 . Nevertheless, the TM1 peptide is still unlikely to fold 
tightly within ER membranes because it is not highly hydrophobic 
(Fig. SI), and this characteristic will allow a small fraction of Nrfl to 
be released into the extra-membranous cytoplamic compartments 
(in which it is digested in membrane proteases protection reactions, 
Fig. 6) or luminal compartments (in which it is protected by mem- 
branes against protease digestion). In reality, the luminal portion of 
Nrfl may also be refolded and thus become reintegrated into the ER 
membrane, in which it would seem to be susceptible to a protease 
attack. 

During biosynthesis of Nrfl, it is anchored by TM1 within the ER 
membrane and the remaining part of the factor is continuously 
translocated into the lumen, where its NST domain is glycosylated 
to yield a polypeptide of 120 kDa. Within the NST domain of Nrfl, 
the glycosylated asparagines that flank the TMi peptide allow the 
protein to become tethered to the luminal leaflet of the membrane 
lipid bilayer, though the TMi sequence may fold into a glycine- 
kinked amphipathic semihydrophobic helix structure spanning 
membranes. This assumption is consistent with bioinformatics ana- 
lysis (Fig. SIB) and experimental evidence provided herein. When 
the TMi sequence is liberated from confinement caused by glycosy- 
lation, it is likely to be reintegrated within membranes and therefore 
its topological orientation seems to be determined by its flanking 
deglycosylated regions. This vectorial process, though it has been 
not sufficiently explored, may allow repartitioning of the AD1 and 
NST domains such that they can adopt a transient membrane- 
spanning topology. It is also possible that the liberated TMi sequence 
promotes repartitioning of the AD2 and SR domains out of mem- 
branes, particularly in response to glucose deprivation. This notion is 
also supported by data showing that mutation of TMi-adjoining 
asparagines to prevent glycosylation increases the activity of Nrfl. 
It is therefore postulated that the TMi peptide could act as a flexible 
hinge switch to control the repartitioning of AD1 and AD2 out of 
membranes, enabling them to engage with the transcriptional 
machinery and induce Nrfl -target gene expression. In turn, topo- 
logical regulation of gene transactivation may also be differentially 
modulated by TMi-flanking AD1 and AD2. This is supported by the 
findings that loss of both TMi-containing NST domain and AD2 (in 
Gal4D/N298-V5 retaining AD 1 ) or AD2 (in Gal4D/N395- V5 retain- 
ing AD1 and NST) caused the reporter gene activity to be reduced to 
approximately 25% or 55% of that of the full-length Gal4D/Nrfl-V5 
factor, respectively (Fig. 6). 

The membrane lipid electrochemical potential of the ER, that gives 
a relatively positive charge on the outside (i.e. the luminal side of 
membranes), promotes translocation of the acidic TADs with net 
negative charges into the ER lumen and also allows retention of the 
net positively charged basic Neh6L and CNC-bZIP domains on the 
cytoplamic side of the ER. This is supported by the data obtained 
from membrane proteinase protection assay, revealing that loss of 
some portions of the Neh6L and bZIP domains causes the resulting 
mutant proteins to be trapped in the ER lumen. Intriguingly, we 
found that it is possible for the TMc sequence to adopt the same 
N cyt /Ci um orientation as TMI (N cyt /Ci um ); this is consistent with the 
positive-inside, charge difference rules, and the hydrophobic 
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Figure 7 | A proposed model to explain the membrane-topogenic behaviour of Nrfl in vectorial processes. After ER targeting, Nrfl is anchored in the 
membrane through TM1. AD1 and NST-adjoining domains are transiently translocated into the ER lumen, where Nrfl is glycosylated through the NST 
domain to become a 120-kDa glycoprotein. During initial topogenesis of Nrfl, the TMi glycopeptide and its adjacent AD2 N-terminal amphipathic 
portion are probably tethered to the luminal leaflet of the membrane lipid bilayer, whilst the TMp peptide may be dynamically associated with 
membranes. Once TMp is orientated in an Ni um /C cyt fashion within membranes, its flanking PEST2 and Neh6L regions are partitioned into the luminal 
and cyto/nucleoplasmic sides, respectively. During this stage, the basic CNC-bZIP domain is retained in the cyto/nucloplasmic sides of membranes, and 
its connecting TMc is likely to be either left in the cytoplasm or integrated into membranes with its C-terminus facing the lumen. If TMp is not integrated 
within membranes, its C-terminally flanking regions might also release into the lumen, until they are repartitioned out of membranes. During topogenesis 
of Nrfl, once TMi is liberated from the restraint of its flanking glycopeptide, it may be reintegrated with membranes. This process would enable the 
repartitioning of AD2 and SRout of membranes, allowing them to act as a bona fideT AD. When appropriate, the luminal AD1 and NST glycodomain are 
partially repartitioned across membranes into the cyto/nucleoplasm, enabling Nrfl to be deglycosylated to produce a 95-kDa activator. Although the 
NHB1 -associated sequence is not processed via regulated intramembrane proteolysis accounting for ATF6 and SREBP-1 (ref. 38 and herein), other 
regions within NTD and/or AD1 might be allowed for proteolytic processing to move the NTD possibly through the Hrdl- and VCP/p97-dependent 
ERAD 22,23,48 and/or PEST 1 -signalling pathways 57 , in order to release an 85-kDa cleaved isoform of Nrfl from the ER/nuclear membranes into the nucleus 
to transactivate ARE-driven gene expression. Lastly, several potential degrons can trigger proteolytic degradation of Nrfl to yield 55-kDa Nrfl fJ/LCR-Fl 
(a weak activator), and the dominant-negative 36-kDa Nrfly and 25-kDa NrflS isoforms. In addition, they can also arise from internal in-frame 
translation. 



gradient along membranes 68,69 . Our findings also suggest that the 
TMc peptide may be dynamically moving in and out of membrane, 
even though it is likely to be integrated in the TX- resistant membrane 
microdomains. Once TMc is integrated in an N cyt /Q um orientation 
spanning membranes, on such occasions the topological organiza- 
tion of Nrfl may force the semihydrophobic amphipathic TMp 
hinge region to span the membrane by folding into an Ni um /Ccyt 
orientation (Figs. 7, SI and S4). Only in the presence of TMI, both 
TMc and TMp can contribute to negative regulation of Nrfl through 
association with membranes under normal homeostatic condi- 
tions 2,39 . In addition, the negative regulation of Nrfl by TMp is also 
supported by the finding that the TMp-truncated Gal4D/N460-V5 
fusion factor exhibited approximately 1.5-fold greater transactiva- 
tion activity than that of the TMp-containing Gal4D/N526- V5 factor 
(Fig. 6). These observations raise the possibility that, after Nrfl is 
anchored by TMI within membranes, either TMc or TMp may adopt 
a co-spanning topology and move dynamically in and out of mem- 
branes (Figs. S2 to S4). This topogenic-vectorial process is likely to 
depend on their intramolecular interactions with TMI through 
small, polar and charged residues (refs. 11,60, and this study). The 
topogenic process may also rely on intermolecular interactions of 
their adjoining CRAC regions with membrane lipids, such as cho- 
lesterol and sphingolipids that are enriched in the detergent-resident 
membrane microdomain 71,72 , but this possibility warrants further 



study. Overall, it appears that the topological organization of Nrfl 
within membranes is highly dynamic (in particular TMp and TMc) 
and is also influenced by changes in the membrane lipids and the 
environment. 

When required, acidic-hydrophobic amphipathic TADs and/or 
other regions of Nrfl are repositioned from the ER lumen into the 
cyto/nucleoplasmic side, where it is allowed for deglycosylation to 
yield a 95-kDa activator and/or for selective proteolysis to generate 
multiple short species with masses of 85-kDa (as a cleaved activator), 
55-kDa Nrfl (3 (as a weak activator), 36 kDa Nrfly and 25 kDaNrfl5 
(the latter two act as dominant-negative isoforms) (Figs, la & S5C); 
they will adopt distinct membrane-topological conformations, lo- 
cated in different subcellular compartments (Fig. S5B). Some of these 
short isoforms may be generated through internal translation from 
the in-frame ATG codons and/or selective proteolysis of longer Nrfl 
proteins by proteosome and/or calpain (this study, see refs. 1 1,39,60). 
Those newly-translated short Nrfl species lacking the TMI deter- 
minant, though they contain other semihydrophobic and amphi- 
pathic sequences (e.g. TMp and/or TMc), cannot be targeted to the 
ER so that they are thus positioned in the cyto/nucleoplasmic com- 
partments (Fig. S5B). Alternatively, other short species that arise 
from post-translational proteolysis of the ER-bound full-length 
Nrfl protein may be positioned in close proximity to membranes; 
this is attributable to a potential cooperation of the semihydrophobic 
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amphipathic regions with TM1 for the overall membrane- 
topological folding of Nrfl before it is proteolytically degraded. 
Once these amphipathic regions would disassociate from TM1, the 
corresponding Nrfl species will be eventually released from mem- 
branes, so that they could gain a distinct membrane-topology. 
Therefore, the use of microscopy following in vivo proteinase protec- 
tion reactions is hard to determine dynamic membrane- topologies of 
distinct Nrfl isoforms by distinguishing their superimposed images. 

In summary, an intrinsic molecular cell mechanism by which 
the membrane- topogenic behaviour of Nrfl controls its post- 
translational modification and transcriptional regulation of target 
genes has been discovered in the present study together with our 
recent work. During translation, Nrfl is targeted to the ER through 
itsNHBl signal peptide (Fig. 7). TheTMl sequence that is embedded 
within the uncleavable NHB1 enables Nrfl to anchor in membranes 
and dictates its initial membrane topology. However, the final topo- 
logical folding of Nrfl requires TM1 to cooperate with semihydro- 
phobic amphipathic TMi, TMp and TMc regions situated in distinct 
domains of the transcription factor. After Nrfl is orientated within 
the ER membrane, its DNA-binding domain (DBD) is partitioned to 
reside on the cytoplasmic side of the membrane, whilst its acidic 
TADs, including its AD1, AD2 and NST glycodomain, are transi- 
ently translocated into the ER lumen, enabling glycosylation of Nrfl 
to create an inactive 120-kDa glycoprotein in the presence of glucose. 
When required, some of the TADs are dynamically repartitioned out 
of membranes and/or retrotranslocated into the cyto/nucleoplasm, 
whereupon Nrfl is deglycosylated to become an active 95-kDa tran- 
scription factor, particularly upon glucose deprivation. The TMi- 
adjoining regions (e.g. AD1 and AD2) may modulate the partial 
repartitioning of TADs and their coupled degrons across membranes 
into the cyto/nucleoplasmic side, where Nrfl is likely allowed for 
selective proteolysis to yield an 85-kDa cleaved isoform. By contrast, 
the TMp-flanking regions between TADs and DBD can monitor 
further proteolytic processing of Nrfl to create dominant-negative 
forms in order to shut down target gene expression. Thus, the 
membrane-related vectorial behaviour of Nrfl dictates its glycosyla- 
tion, deglycosylation, selective proteolysis, and possibly its transport 
into the nucleus, which together control its TAD-mediated tran- 
scriptional activity through binding of the DBD/sMaf heterodimer 
to ARE sequences in target genes. Overall, our present work provides 
a platform for moving forward to tease out how Nrfl is regulated 
downstream of and possibly during the post-translational processing 
that is selectively controlled by the membrane- topogenic vectorial 
behaviour of this factor. 

Methods 

Chemicals and antibodies. All chemicals were of the highest quality commercially 
available. The ER extraction kit was purchased from Sigma- Aldrich. The enzymes 
Endo H and PK were obtained from New England Biolabs. Rabbit polyclonal 
antibodies against calreticulin (CRT) and green fluorescent protein (GFP) were 
bought from Calbiochem and Abeam PLC, respectively. A mouse monoclonal 
antibody against the V5 epitope and rabbit polyclonal antibodies against DsRed {a 
Discosoma sp. red fluorescent protein) were from Invitrogen Ltd. Antisera against 
Nrfl that were produced in rabbits using a polypeptide covering amino acids 
292-741. 

Expression constructs. Expression constructs for full-length mouse Nrfl and its 
mutants, along with other expression constructs for N65/GFP and Nrfl/GFP fusion 
proteins, have been created as described previously 36,39 . Three sandwich fusion 
proteins DsRed/Nrfl/GFP, DsRed/Nrf 1 AC / GFP and DsRed/N65/GFP, were 
engineered by inserting nucleotide sequences encoding Nrfl, Nrfl AC (lacking the C- 
terminal 56 amino acids of Nrfl), and N65 (containing the first N-terminal 65 aa of 
Nrfl), respectively, between the cDNAs for DsRed2 and GFP within the pDsRed2- 
GFP vector through the Sall/Kpnl multiple cloning site 39 . A series of Gal4D/Nrfl 
expression constructs were generated by ligating cDNA sequences that encode full- 
length Nrfl, or its N-terminal 181, 298, 395, 460, 526 or 607 amino acids (called N181, 
N298, N395, N460, N526 and N607, respectively), to the 3'-end of cDNA encoding 
the Gal4 DNA-binding domain (called Gal4D), within pcDNA3.1Gal4D-V5, through 
the BamHI/EcoRI cloning site 36 . All oligonucleotide primers were synthesized by 
MWG Biotech, and the fidelity of all cDNA products was confirmed by sequencing. 



Cell culture, transfection, and reporter gene assays. Unless otherwise indicated, 
monkey kidney COS- 1 cells (3 X 10 5 ) were seeded in 6-well plates and grown for 24 h 
in Dulbecco's modified Eagle's medium (DMEM) contain 25 mM glucose and 10% 
FBS. After reaching 70% confluence, the cells were transfected with a Lipofectamine 
2000 (Invitrogen) mixture that contained an expression construct for wild-type Nrfl 
or a mutant protein, together with P sv40 nqo 1-ARE-luc, P SV40 GSTA2-6X ARE-Luc 
(which contains six copies of the core ARE consensus sequence from rat GSTA2, see 
refs. 36,39,73), or P TK UAS/Gal4-Luc 39 , along with pcDNA4/HisMax//acZ encoding 
P-galactosidase ( J3-gal) that was used as a control for transfection efficiency. In 
addition, mutant versions of these reporter genes that lacked the ARE sequence were 
used as negative controls. Luciferase or CAT activity was measured approximately 
36 h after transfection. The basal and stimulated ARE-driven reporter gene activity 
obtained following transfection with an expression vector for Nrfl (or its mutants) 
was calculated as a ratio of its value against the background activity (i.e. the luciferase 
acitivity obtained following co-transfection of an empty pcDNA3.1/V5 His B vector 
and an ARE-driven reporter after subtraction of the non-specific value from 
cotransfecting an empty pcDNA3.1/V5 His B vector and a non-ARE-containing Luc 
plasmid). Subsequently, the basal activity of full-length wild-type Nrfl was given the 
value of 1.0, and other data were calculated as a ratio of this value and are shown as a 
fold change (mean ± S.D). The data presented each represent at least 3 independent 
experiments undertaken on separate occasions that were each performed in triplicate. 
Differences in their transcriptional activity were subjected to statistical analyses. 

Subcellular fractionation and membrane PK protection reactions as combined in 
FPP, dFPP or GRAPPA. To investigate the topological folding of Nrfl within 
membranes and its repartitioning out of membranes, the intact ER-rich membrane 
fractions were prepared from COS-1 cells expressing Nrfl fusion protein in classic 
membrane PK protection assays, as described previously 40 ' 74-77 . The intact ER-rich 
fraction was resuspended in 100 ul of 1 X isotonic buffer [10 mM Hepes, pH 7.8, 
containing 250 mM sucrose, 1 mM EGTA, 1 mM EDTA and 25 mM KC1]. 
Subsequently, membrane proteinase protection reactions were performed for 15, 30 
or 60 min on ice in an aliquot (50 ug of protein) of the membrane-containing 
preparation with PK added to a final concentration of 50 or 100 ug protein/ml in 
either the presence or absence of 1% (v/v) Triton X-100 (TX). The reactions were 
stopped by incubation at 90 C for 10 min following the addition of 1 mM PMSF. 
Subsequently, the reactants were analysed by western blotting with antibodies against 
NrflJ3, or the V5 ectope. The amount of Nrfl protein remaining after PK digestion 
was calculated as described below. Importantly, fluorescence protease protection 
(FPP) assay 63 ' 64 was described to determine the topology of membrane proteins 
(which terminus is facing either the luminal or the cytoplasmic sides of membranes). 
Double fluorescence sandwiched proteins linked to proteinase protection reaction 
was called dFPP herein. Gal4-based reporter assays linked to protease protection 
assays (called GRAPPA) were used as a measure of whether the membrane-topogenic 
vectorial behaviour of Nrfl has an impact on reporter gene expression. 

Effects of cellular glucose concentrations on Nrfl activity and analysis following 
treatment with CHX, TU, Z-VAD-fmk or MG132. COS-1 cells (3 X 10 5 ) were 
cultured for 24 h in DMEM containing 25 mM glucose and 10% FBS. After reaching 
80% confluence, the cells were transfected for 6 h with 5 ul of a Lipofectamine 2000 
mixture containing 1.5 ug of the expression construct for wild-type Nrfl and 0.2 ug 
of pcDNA4/HisMax//acZ. The transfected cells were allowed to recover for 16 h in 
DMEM containing 25 mM glucose (control) or 1.0 mM glucose (starvation). 
Thereafter, the 1.0 mM glucose-starved cells were transferred to medium containing 
25 mM glucose supplemented with vehicle DMSO, TU (1 ug/ml) alone or plus 
MG132 (5 uM). The transfected cells that had been left to recover in medium 
containing 25 mM glucose were subjected to: (i) glucose deprivation, using medium 
that lacked glucose to which was added 0.1% (v/v) DMSO vehicle control, Z-VAD- 
fmk (20 uM) alone or plus MG132 (5 uM); (ii) hypoglucose restriction, using 
medium containing 5.5 mM glucose to which was added TU (1 |J.g/ml) alone or plus 
MG132 (5 uM); (iii) normal glucose supply using medium containing 25 mM 
glucose to which was added CHX (50 ug/ml), TU (1 ug/ml) alone or plus MG132 
(5 uM). The cells were subsequently harvested at the indicated time points after 
manipulation of glucose levels. The expression of Nrfl in the cells treated with 
glucose-free medium or medium containing 5.5 mM glucose or 25 mm glucose was 
examined by western blotting with antibodies against the V5 tag. The intensity of 
blots was quantified using the ImageJ programme developed by NIH, and was then 
used to calculate changes in the abundance of various Nrfl isoforms. 

Deglycosylation reactions and western blotting. The in vitro deglycosylation 
reactions of each sample with 500 units of Endo H were carried out, followed by 
western blotting. On some occasions, nitrocellulose membranes that had already been 
blotted with an antibody were washed for 30 min with stripping buffer [7M guanidine 
hydrochloride, 50 mM glycine, 0.05 mM EDTA, 0.1 M KC1 and 20 mM 2- 
mercaptoethanol at pH 10.8], before being re-probed with an additional primary 
antibody against CRT or |3-Actin; both served as internal controls to verify equal 
loading of protein into each electrophoretic well 78 . The intensity of blots was 
calculated using the Quantity One® software developed at Bio-Rad Laboratories. 

Bioinformatic analyses. The membrane-topology of Nrfl was initially 
predicted using several bioinformatic algorithms, including the TopPred 
(http://mobyle.pasteur.fr/cgi-bin/portal.py?form— toppred), HeliQuest 
(http://heliquest.ipmc.cnrs.fr/) and AmphipaSeek (http://npsa-pbil.ibcp.fr/cgi-bin/ 
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npsa_automat.pl?page = /NPSA/npsa_amphipaseek.html) programmes. The 
predicted topologies were evaluated using molecular and biochemical experiments. 

Statistical analysis. The statistical significance of changes in Nrfl activity and in the 
intensity of immunoblots, was determined using the Student's t test or Multiple 
Analysis of Variations (MANOVA). The data presented herein are shown as a fold 
change (mean ± S.D), each of which represents at least 3 independent experiments 
undertaken on separate occasions that were each performed in triplicate 
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